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"METHODS AND MATERIALS RELATING TO 
DNA BINDING PROTEINS" 



CROSS— REFERENCE TO RELATED APPLICATIONS 

10 This is a continuation-in-part application of 

co-pending U.S. Patent Application Serial No. 179,587, 
filed April 8, 1988. 



BACKGROUND OF THE INVENTION 



The present invention relates generally to DNA 

20 binding regulatory proteins and more particularly to DNA 
sequences encoding early growth regulatory proteins 
possessing histidine-cysteine "zinc finger" DNA binding 

25 domains, to the polypeptide products of recombinant 

expression of these DNA sequences, to peptides and poly- 
peptides whose sequences are based on amino acid 

3Q sequences deduced from these DNA sequences, to anti- 
bodies specific for such proteins and peptides, and to 
procedures for detection and quantification of such pro- 
teins and nucleic acids related thereto. 

35 Among the most significant aspects of mam- 

malian cell physiology yet to be elucidated is the 
precise manner in which growth factors (e.g., hormones, 
neurotransmitters and various developmental and 
differentiation factors) operate to effect the regula- 
tion of cell growth. The interaction of certain growth 
factors with surface receptors of resting cells appears 
to rapidly induce a cascade of biochemical events 
thought to result in nuclear activation of specific 
growth related genes, followed by ordered expression of 
other genes. Analysis of sequential activation and 
expression of genes during the transition from a resting 
state ("G 0 ") to the initial growing state ("G^') has 
been the subject of substantial research. See, gener- 



ally, Lau et al., Proc. Nat'l. head. Sci. (USA) , 84, 
1182-1186 (1987). Much of this research has involved 
analysis of the expression of known genes encoding sus- 
pected regulatory proteins (such as the proto-oncogenes , 
c-fos and c-myc) following mitogen stimulation. An 
alternative approach has involved attempts to identify 
genes activated by mitogenic stimuli through differen- 
tial screening of CDNA libraries prepared from resting 
cells following exposure to serum and specific growth 
factors. See, e.g., Lau et al., EttBO Journal , 4, 
3145-3151 (1985). See also, Cochran et al., Cell , 33 , 
939-947 (1983), relating to the cloning of gene 
sequences apparently regulated by platelet derived 
growth factor. 

Of interest to the background of the invention 
is the continuously expanding body of knowledge regard- 
ing structural components involved in the binding of 
regulatory proteins to DNA. Illustratively, the so- 
called receptor proteins are believed to bind to DNA by 
means of zinc ion stabilized secondary structural 
fingers premised on folding of continuous amino acid 
sequences showing high degrees of conservation of 
cysteines and histidines and hydrophobic residues. See, 
e.g., Gehring, TIBS , 12, 399-402 (1987). For example, a 
"zinc finger" domain or motif, present in Xenopus 
transcription factor IIIA (TF IIIA), as well as the 
Drosophila Kruppel gene product and various yeast 
proteins, involves "repeats" of about 30 amino acid 
residues wherein pairs of cysteine and histidine 
residues are coordinated around a central zinc ion and 
are thought to form finger-like structures which make 
contact with DNA. The histidine-cysteine (or "CC-HH") 
zinc finger motif, as opposed to a cysteine-cysteine 
("CC-CC") motif of steroid receptors, is reducible to a 
consensus sequence represented as C-X2_4-C-X3-F-X5-L-X 2 - 
H-X 3 -H wherein C represents cysteine, H represents 



histidine, F represents phenylalanine, L represents 
leucine and X represents any amino acid* [See, Klug et 
al. TIBS , 12, 464-469 (1987); Blumberg et al., Nature , 
328, 443-445 (1987); and Schuh et al.. Cell , 47, 
1025-1032 (1986).] 

Of particular interest to the background of 
the invention is the recent report of Chowdhury et al., 
Cell , 48, 771-778 (1987) , relating to an asserted 
"family" of genes encoding proteins having histidine- 
cysteine finger structures. These genes, designated 
"mkrl" and "mkr2" , appear to be the first such isolated 
from mammalian tissue and are not correlated to any 
early growth regulatory events. 

There continues to exist a need in the art for 
information concerning the primary structural conforma- 
tion of early growth regulatory proteins, especially DNA 
binding proteins, such as might be provided by knowledge 
of human and other mammalian DNA sequences encoding the 
same. Availability of such DNA sequences would make 
possible the application of recombinant methods to the 
large scale production of the proteins in procaryotic 
and eukaryotic host cells, as well as DNA— DNA and 
DNA-RNA hybridization procedures for the detection, 
quantification and/or isolation of nucleic acids asso- 
ciated with these and related proteins. Possession of 
such DNA-binding proteins and/or knowledge of the amino 
acid sequences of the same would allow, in turn, the 
development of monoclonal and polyclonal antibodies 
thereto (including antibodies to protein fragments or 
synthetic peptides modeled thereon) for use in immuno- 
logical methods for the detection and quantification of 
early growth regulatory proteins in fluid and tissue 
samples as well as for tissue specific delivery of sub- 
stances such as labels and therapeutic agents to cells 
expressing the proteins. In addition, DNA probes based 
on the DNA sequences for these mammalian early growth 
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regulatory proteins may be of use in detecting gene 
markers used for the diagnosis of those clinical 
5 disorders which are linked to the marker genes. 

BRIEF SUMMARY OF THE INVENTION 

k1 The present invention provides novel purified 

and isolated DNA sequences encoding mammalian early 
growth regulatory ("Egr") proteins which comprise one or 

15 more histidine-cysteine zinc finger amino acid sequences 
putatively providing DNA binding (and hence DNA replica- 
tion or transcription regulatory) capacity. In 

20 presently preferred forms, novel DNA sequences of the 
invention comprise genomic and cDNA sequences encoding 
human and mouse early growth regulatory proteins. 

25 Alternate DNA forms, such as "manufactured" DNA, 

prepared by partial or total chemical synthesis from 
nucleotides r are also within the contemplation of the 
invention. 

30 Operative association of Egr-encoding DNA 

sequences provided by the invention with homologous or 
heterologous species expression control DNA sequences, 
35 such as promoters, operators, regulators and the like, 
allows for in vivo and in vitro transcription to form 
messenger RNA which, in turn, is susceptible to transla- 
tion to provide Egr proteins in large quantities. In 
one presently preferred DNA expression system practiced 
according to the invention, Egr-encoding DNA is opera- 
tively associated with a bacteriophage T3 or T7 RNA 
promoter DNA sequence allowing for in vitro transcrip- 
tion and translation in a cell free system. Incor- 
poration of novel DNA sequences of the invention into 
procaryotic and eucaryotic host cells by standard trans- 
formation and transfection processes involving suitable 
viral and circular DNA plasmid vectors is also within 
the contemplation of the invention and is expected to 
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provide useful proteins in quantities heretofore 
unavailable from natural sources. Illustratively, frag- 
ments of DNA encoding Egr protein of the invention have 
been incorporated in plasmid vectors resulting in 
expression by transformed E.coli hosts of fusion pro- 
teins sharing immunological characteristics of Egr 
protein. Use of mammalian host cells is expected to 
provide for such post-translational modifications (e.g., 
truncation, glycosylation, and tyrosine, serine or 
15 threonine phosphorylation) as may be needed to confer 
optimal biological activity on recombinant expression 
products of the invention. 
20 Also provided by the present invention are 

novel, presumptively mitogen sensitive, DNA sequences 
involved in regulation of the transcription of Egr- 
encoding DNA, which sequences are expected to have 
utility in the efficient recombinant expression of Egr 
proteins as well as proteins encoded by other structural 
genes. In addition, the DNA sequences may be used as 
probes to detect the presence or absence of gene markers 
used for the diagnosis of clinical disorders linked to 
those gene markers. 
35 Novel polypeptide products of the invention 

include polypeptides having the primary structural con- 
formation (i.e., amino acid sequence) of Egr proteins or 
fragments thereof, as well as synthetic peptides, an 
analogs thereof, assembled to be partially or wholly 
duplicative of amino acid sequences extant in Egr 
proteins. Proteins, protein fragments, and synthetic 
peptides of the invention are expected to have 
therapeutic, diagnostic, and prognostic uses and also to 
provide the basis for preparation of monoclonal and 
polyclonal antibodies specifically immunoreactive with 
Egr proteins, as well as to provide the basis for the 
production of drugs for use as competitive inhibitors or 
potentiators of Egr. Preferred protein fragments and 
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synthetic peptides of the invention include those 
duplicating regions of Egr proteins which are not 
involved in DNA binding functions (i.e., regions other 
than the zinc fingers). Most preferred are peptides 
which share at least one continuous or discontinuous 
antigenic epitope with naturally occurring Egr pro- 
teins . 

Antibodies of the invention preferably bind 
with high immunospecif icity to Egr proteins, fragments r 
15 and peptides, preferably recognizing epitopes which are 
not common to other proteins, especially other DNA bind- 
ing proteins. 

Also provided by the present invention are 
novel procedures for the detection and/or quantification 
of Egr proteins and nucleic acids (e»g., DNA and mRNA) 
specifically associated therewith. Illustratively, anti- 
bodies of the invention may be employed in known immuno- 
logical procedures for quantitative detection of Egr 
proteins in fluid and tissue samples. Similarly, DNA 
30 sequences of the invention (particularly those having 
limited homology to other DNAs encoding DNA binding 
proteins) may be suitably labelled and employed for the 
35 quantitative detection of mRNA encoding the proteins. 
Information concerning levels of Egr mRNA may provide 
valuable insights into growth characteristics of 
cells . 

Among the multiple aspects of the present 
invention, therefore, is the provision of (a) novel 
purified and isolated Egr-encoding DNA sequences set out 
in Figures 1A, 3, and 4 as well as (b) Egr-encoding DNA 
sequences which hybridize thereto under hybridization 
conditions of the stringency equal to or greater than 
the conditions described herein and employed in the 
initial isolation of DNAs of the invention, and (c) 
synthetic or partially synthetic DNA sequences encoding 
the same, or allelic variant, or analog Egr polypeptides 
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which employ, at least in part, degenerate codons. 
Correspondingly provided are viral or circular plasmid 
DNA vectors incorporating such DNA sequences and pro- 
caryotic and eucaryotic host cells transformed or trans- 
fected with such DNA sequences and vectors as well as 
novel methods for the recombinant production of Egr 
proteins through cultured growth of such hosts and iso- 
lation from the hosts or their culture media. 

Preferred polypeptide products of the inven- 
15 tion include those wholly or partially duplicating the 
deduced sequence of the amino acid residues set out in 
Figures 1A and 3 (i.e., mouse "Egr-1" and human 
"EGR2"). Other preferred polypeptides include fusion 
proteins such as cro-e-galactosidase/Egr-1 and bovine 
growth hormone/Egr-1 . 

Presently preferred antibodies of the inven- 
tion include those raised against synthetic peptides 
partially duplicating deduced Egr amino acid sequences 
of Figures 1A and 3 (e.g., the synthetic peptides H-L-R- 
30 q-k-D-K-K-A-D-K-S-C, the first 12 amino acid residues of 
which duplicate mouse Egr-1 residues 416-427 with the 
last cysteine added for coupling to KLH; and C-G-R-K-F- 
35 A-R-S-D-E-R-K-R-H-T-K-I duplicating mouse Egr-1 residues 
399-415). The antisera against the first peptide is 
designated VPS10 and comprises a preferred antibody of 

the invention. 

As employed herein, the term "early growth 
regulatory protein" shall mean and include a mammalian 
DNA binding protein encoded by DNA whose transcription 
temporally corresponds to cellular events attending the 
Gg/Gi growth phase transition. As employed herein, 
"histidine-cysteine zinc finger amino acid sequence" 
shall mean and include the following sequence of amino 
acids C-X 2 _ 4 -C-x 3 -F-X 5 -L-X 2 -H-X 3 -H wherein C represents 
cysteine, H represents histidine, F represents phenyl- 
alanine, L represents lysine, and X represents an amino 
acid. 



Other aspects and advantages of the present 
invention will be apparent upon consideration of the 
following detailed description thereof which includes 
numerous illustrative examples of the practice of the 
invention, reference being made to the drawing wherein; 

Figure 1A provides a 3086 base nucleotide 
sequence for a mouse Egr-1 DNA clone as well as a 
deduced sequence of 533 amino acid residues for the 
protein? Figure IB provides a partial restriction map of 
Egr-1 DNA clones together with information concerning 
the position of the protein coding sequence and the 
locus of amino acids providing for histidine-cysteine 
zinc fingers? 

Figure 2 provides an amino acid sequence 
alignment of the DNA binding domain of mouse Egr-1 in 
comparison with a zinc finger consensus sequencer with 
the Dtosophila Kruppel sequence and with the "finger 2" 
sequence of Xenppus TFIIIA protein; 

Figure 3 provides a 2820 base nucleotide 
sequence fox a human EGR2 cDNA clone as well as a 
deduced sequence of 456 amino acids for the protein; 

Figure 4 provides a 1200 base nucleotide 
sequence of a mouse Egr-1 genomic clone , specifically 
illustrating the 5* non- transcribed regulatory region 
thereof comprising bases -935 through +1; and 

Figure 5 provides a restriction map and 
organization of the mouse Egr-1 genomic clone mgEgr-1.1 
and a comparison to mouse Egr-1 cDNA. 

DETAILED DESCRIPTION 

The following examples illustrate practice of 
the invention. Example 1 relates to the preparation and 
structural analysis of cDNA for mouse Egr-1. Example 2 
relates to confirmation of the presence of an Egr DNA 
sequence on human chromosome 5. Example 3 relates to 
the in. vitro transcription and translation of mouse 
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Egr-1 cDNA. Example 4 relates to production of anti- 
bodies according to the invention. Example 5 relates to 
the isolation and characterization of genomic DNA which 
encodes mouse Egr-1. Example 6 relates to the isolation 
and characterization of cDNA encoding human EGR2. 
Example 7 relates to preparation, in an E. coli host, of 
a recombinant fusion protein including a portion of the 
deduced amino acid sequence of mouse Egr-1. Example 8 
relates to use of DNA probes of the invention in the 
15 quantitative detection of EGR1 mRNA. 

These examples are for illustrative purposes 
only and are not intended in any way to limit the scope 
20 of the invention. 



10 



25 



30 



35 



EXAMPLE 1 

Preparation and Structural 
Analysis of cDNA for Mou se Egr-1 

Isolation of DNA encoding a mammalian early 
growth regulatory protein including one or more histi- 
dine-cysteine zinc finger amino acid sequences was per- 
formed substantially according to the procedures des- 
cribed in Sukhatme et al.. Oncogene Research, 1, 343-355 
(1987), the disclosures of which are specifically incor- 
porated by reference herein. 

Balb/c 3T3 cells (clone A31) from the American 
Type Culture Collection were grown to confluence in 
Dulbecco's Modified Eagle's medium (DME) supplemented 
with 10% fetal calf serum (FCS). The cells were ren- 
dered quiescent by reduction of the serum concentration 
to 0.75% for 48 hours. To induce the cells from quies- 
cence into growth phase G x , the medium was changed to 
20% FCS with cycloheximide added to a final concentra- 
tion of 10 ug/ml. 

RNA was extracted from Balb/c 3T3 cells har- 
vested three hours after induction of quiescent cells by 



20% FCS and 10 vg/ml cycloheximide . A xgtlO cDNA 
library was constructed from this mRNA according to the 
procedures of Huynh et al., DNA Cloning , Vol. 1 / 49-78 
(Glover , D. r ed., IRL Press, 1985). This library was 
screened differentially with single stranded cDNA pre- 
pared from quiescent cells and from cells exposed to 
serum and cycloheximide for 3 hours. These 32 P-labeled 
cDNA probes were prepared from poly A + RNA as described 
in St. John, et al. r Cell , 16, 443-452 (1979), except 
that 100 yCi of 32 P-dCTP (>3000 Ci/mmol), 0.02 mM cold 
dCTP and 2-5 pg of poly A + RNA was used in each reac- 
tion. The mean size of the reverse transcribed probes, 
as assessed by alkaline agarose gel electrophoresis and 
subsequent autoradiography, was about 700 bases. 
Replica filter lifts (GeneScreenPlus, NEN-DuPont) were 
prepared essentially as described by Benton et al.. 
Science , 196 , 180-192 (1977), and approximately 3 x 10 6 
cpm of 32 P-cDNA were used per filter (90 mm diameter). 
Hybridizations were carried out at 65°C in 1% SDS, 10% 
dext ran sulfate, and 1. ML NaCl for a period of 16 
hours. The filters were washed twice for twenty minutes 
each time, first at room temperature in 2 x SSC 
[Maniatis et al., Molecular Cloning , Cold Spring Harbor 
Laboratory (New York, 1982)), then at 65°C in 2 x SSC, 
1% NaDodS0 4 and finally at 65°C in 0.2 x SSC. 
Autoradiograms were prepared by exposing the blots for 
18 hours at -70°C with an intensifying screen. 

A total of 10,000 cDNA clones from the Balb/c 
3T3 xgtlO library were differentially screened. 
Seventy-eight clones were found to hybridize preferen- 
tially to single-stranded cDNA from fibroblasts stimu- 
lated for 3 hours with 20% FCS and cycloheximide as 
compared to single-stranded cDNA from quiescent cells. 
Inserts from these clones were cross-hybridized to each 
other, resulting in the sorting of forty clones into 7 
cDNA families one of which was identified as c-f os . 
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Another cDNA clone, referred to as OC68, contained a 
2.2 kb insert and was characterized further. This 
insert was subcloned into the Eco RI site of pUC13 and 
probes were generated for Northern blot analysis either 
from the insert or the corresponding pUC plasmid. 
Figure IB illustrates a partial restriction digest map 
of the OC68 clone ("R" representing restriction sites 
for Rsal) along with that of a shorter clone, OC19t. 
Two Rsal digestion fragments, derived from the 5' end of 
15 clone OC68 and each comprising approximately 130 base 

pairs, were labeled and employed to re-screen the above- 
described xgtlO cDNA library, resulting in the recovery 
of a 3.1 kb clone, designated OC3.1, shown in figure 
IB. This clone was sequenced according to the method of 
Sanger et al., Proc. Nat'l. Acad. Sci. (USA), 74, 5463 
(1977). The 3086 base pair sequence obtained is set 
forth in Figure 1A along with the deduced sequence of 
533 amino acid residues for the protein encoded, 
designated mouse "Egr-l" . 
30 Tne deduced amino acid sequence shows a single 

long open reading frame with a stop codon (TAA) at posi- 
tion 1858. The most 5', in-frame, ATG, at position 259, 
35 is flanked by sequences that fulfill the Kozak criterion 
( A NN(ATG)G) [Kozak, Hue. Acids Res. , 15, 8125-8131 
(1987)]. The sequence region upstream of this ATG is 
highly GC-rich and results in an absence of in-frame 
stop codons. The 3' untranslated region (OT) contains 
two "AT" rich regions (nucleotides 2550-2630 and 
2930-2970). Similar sequences are found in the 3' OT 
regions of several lymphokine and proto-oncogene mRNAs, 
including granulocyte macrophage colony stimulating 
factor (GM-CSF), interleukin 1, interleukin 2, inter- 
leukin 3 (IL-3), o, B, and y interferons, and c-fos, 
c-myc, and c-myb [Shaw et al., Cell, 46' 659-667 
(1986)]. These sequences may mediate selective mRNA 
degradation. The presence in the mouse Egr-l transcript 



of such regions is consistent with its short message 
half -life. Potential polyadenylation signals (AATAAA) 
are located at nucleotide positions 1865 and 3066 , as 
well as at position 3053 (AATTAA) [Wickens et al.. 
Science r 226 , 1045-1051 (1984)]. 

The deduced amino acid sequence predicts a 
polypeptide of 533 amino acids with a molecular weight 
of 56,596. Based on structural considerations, namely a 
central region containing zinc fingers (described 
below), the Egr-1 protein can be divided into three 
domains. The N-terminal portion (amino acid residues 2 
to 331) is rich in proline (14.2%) and serine (16%) 
residues with 7*9% alanines and 7.9% threonines. The C- 
terminal region (residues 417 to 533) also contains a 
very high proportion of prolines and serines (15.4 and 
26.5%, respectively) and 10.3% alanines and 11.1% 
threonines. The large number of proline residues leads 
to a secondary structure that probably lacks 
a-helices. The central portion of the Egr-1 protein 
consists of three tandem repeat units of 28-30 amino 
acids, with the first unit starting at position 332. 
Each unit conforms almost exactly to the consensus 
sequence TGX 3 JxcX 2 _ 4 CX 3 FX 5 LX 2 HX 3 H (see Figure 2>, 
diagnostic of DMA binding zinc fingers [Berg, Science , 
232 , 485-486 (1986); Brown et al. , Nature, 324 , 215 
(1986); and Brown et al., FEBS Letters , 186 , 271-274 

(1985) ]. Furthermore, the Egr-1 fingers are connected 
by "H-C links" (TGE^P^i [Scfauh et al., Cell , 47 , 
1025-1032 (1986)] found in the Xenopus TFIIIA gene 
(between fingers 1, 2, and 3), in the Drosophila Kruppel 
gap gene [Rosenberg et al», Nature , 319 , 336-339 

(1986) ], and in genes from mouse and Xenopus that cross- 
hybridize to the Kruppel (Kr) finger domains t mkrl, - 
mkr2 [Chowdhury et al-, Cell , 48, 771-778 (1987)], and 
Xfin [Altaba et al., EMBO Journal , 6, 3065-3070 

(1987) ]. The sequence similarity amongst the Egr-1 
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fingers is 50-70%, whereas the sequence similarity 
between any of the Egr-1 fingers and those present in 
TFIIIA, Kruppel, mkrl, mkr2 or Xfin is 35-40%. Outside 
of the finger domains, it is noteworthy that the Egr-1 
and Kr proteins each contain a very high proportion of 
Pro, Ala, and Ser residues [Schuh et al.. Cell, 47, 
1025-1032 (1986)]. However, there is no sequence 
similarity in these regions. Thus, Egr-1 and Kr are not 
homologous genes nor is Egr-1 related to mkrl, mkr2, 
15 Xfin, or TFIIIA. The Kr gene contains thirteen copies 
of the hexanucleotide (ACAAAA) , or its complementary 
sequence, eight of which are located within 180 bp 
downstream from the Kr TATA box and five are in the 3« 
OT region. These sequences may serve as targets for 
other DNA binding proteins or in Kr gene autoregula- 
tion. The Egr-1 cDNA also contains nine copies of the 
ACAAAA sequence or its complement. 

Following the work described above, Milbrandt 
[ Science , 238 , 797-799 (1987)], reported the isolation 
and sequence of a nerve growth factor (NGF) inducible 
cDNA (NGFI-A) from the rat pheochromocytoma PC12 line. 
A comparison of the deduced amino acid sequence of 
NGFI-A to that of mouse Egr-1 of Figure 1A reveals 98% 
sequence identity. Thus, mouse Egr-1 and rat NGFI-A are 
homologs. The putative initiation ATG chosen by 
Milbrandt corresponds to position 343 in the Figure 1A 
cDNA sequence, and is 84 nucleotides (28 amino acid 
residues) downstream of the ATG therein designated for 
translation initiation. Both ATG's have a purine at 
position -3 and a G at position +1 and the designation 
represented in Figure 1A of the more 5' ATG as the puta- 
tive start codon is based on the experience of Kozak, 
Nuc. Acids Res. , 15, 8125-8131 (1987), even though the 
more 3* ATG is surrounded by the longer Kozak consensus 
sequence ( CCG/ACCATGG ) . Translation of an in vitro 
generated RNA transcript, described infra, selects the 
more 5' ATG for initiation. 
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It is noteworthy that a major difference in 
the deduced sequences of mouse Egr-1 and rat NGFI-A 
5 resides in the sequence spanning residues 61-68 of Egr-1 
and 33-43 of NGFI-A* The former includes the sequence 
N-S-S-S-S-T-S-S while the latter includes the sequence 
N -N-S-S-S-S-S-S-S-S~S, accounting for the 3 residue 
L ° difference in length of the putative polypeptides which 
is not accounted for by the difference in designation of 
the transcript initiation signal. 

15 

EXAMPLE 2 

20 Human Chromosome Gene Ma pping 

To determine the human chromosomal 
localization of the gene corresponding to mouse Egr-1, 
25 the 0C3.1 and OC19t cDNA clones were hybridized to a 

panel of rodent x human somatic cell hybrids. Southern 
blot analysis of the hybrid panel showed concordance 
between the presence of Egr-1 sequences and human 
30 chromosome 5. In situ hybridization to normal human 

metaphase chromosomes resulted in specific labeling only 
of chromosome 5, with the largest cluster of grains at 
35 5q23-31. Specific labeling of these bands was also 

observed in hybridizations using an Egr-1 probe which 
does not contain finger sequences. 

This localization is interesting in light of 
the non-random deletions [del(5q)] in human myeloid 
disorders (acute myelogenous leukemia) (AMI*) , and 
myelodysplastic syndromes, that involve this chromosomal 
region. [Le Beau et aL f Science , 231 , 984-987 (1986); 
Dewald et al., Blood , 66, 189-197 (1985); and Van den 
Berghe et al.r Cancer Genet . Cytogenet. , X7, 189-255 
(1985)]. Fifty percent of patients with therapy related 
AML show chromosome 5 abnormalities (interstitial dele- 
tions or monosomy) and cytogenetic analysis of the dele- 
tions has revealed that one segment, consisting of bands 
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5q2 3-3i, is absent in the malignant cells of all 
patients who have aberrations of chromosome 5. These 
5 data suggest that loss of a critical DNA sequence lead- 
ing to hemi zygosity (or homozygosity) of a recessive 
allele may play an important role in the pathogenesis of 
these disorders, a mechanism substantiated for retino- 

10 blastoma. Although genes for a number of growth factors 
and receptors (IL-3, GM-CSF, B 2 -adrenergic receptor, 
endothelial cell growth factor, CSF-1, c-fms, pDGP 

15 receptor) are clustered in or near this region, Egr-1 
(by virtue of its zinc fingers) is the only member of 
this group with potential transcriptional regulatory 

20 activity, it is therefore possible that its absence 
could lead to deregulated cell growth. 



EXAMPLE 3 

25 

In Vitro Expression of Mouse Eqr-1 cDNA 
"™~~™~ a 2.1 kb Apal/Apal fragment (comprising 
30 nucleotides 120-2224 of Figure 1A) was isolated from the 
OC3.1 DNA clone. This fragment includes the translation 
start (ATG) codon at nucleotide position 259 designated 
35 in Figure 1A. The fragment was blunt-ended with T4 DNA 
polymerase and cloned- into the Bluescript vector KS 
M13(+) containing a T3/T7 bacteriophage promoter. The 
(T3) sense transcript was generated and in vitro trans- 
lated in a standard rabbit reticulocyte lysate system 
(Promega Biotec, Madison, WI. 53711) including 
35 s methionine as a radiolabel. An analogous in vitro 
transcription system was developed using a Bglll/Bglll 
fragment of OC3.1 (including nucleotides 301-1958 and 
not including the translation start designated in Figure 
1A) . The T7 sense transcript was employed in the trans- 
lation system. Differential characterization of trans- 
lation products by autoradiographic SDS PAGE indicated 
that the ATG at nucleotide position 259 is preferred as 
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a translation start codon when all potential start sites 
are present. 

EXAMPLE 4 

Preparation of Antibodies ; 

A first synthetic peptide based on the 
sequence of amino acid residues 416-427 of mouse Egr-1 
was prepared and provided with a carboxy terminal 
15 cysteine residue- The peptide , H-L-R-Q-K-D-K-K-A-D-R- 
S-C, was coupled to KLH and employed to immunize New 
Zealand white rabbits. Animals were initially immunized 
with 100 ng of the immunogen in Freund's Complete 
Adjuvant and every two weeks were boosted with 100 ug of 
immunogen with Freund's Incomplete Adjuvant. Sera, 
designated VPS10, were isolated after 68 days and 
displayed an antibody titer of 1:12 r 800 based on 
reactivity with the antigen used to prepare the anti- 
sera. 

30 A secon( j synthetic peptide, based on residues 

399 to 415 of mouse Egr-1, was prepared. The peptide, 
C-G-R-K-R-A-R-S-D-E-R-K-R-H-T-K-I , was coupled to KLH 

35 and used to immunize rabbits as above, resulting in the 
production of antisera (designated VPS2) with a titer of 
1:400. 

EXAMPLE 5 

Isolation of Genomic Mouse Egr-1 Clone and 
Characterization of Regulatory Regions 

A mouse Balb/c 3T3 genomic library was pre- 
pared in a Stratagene (La Jolla, California) vector r 
XFIX, according to the manufacturer's instructions and 
probed using 1% SDS, 1 M NaCI, and 10% dextran sulfate 
at 65°C with stringent final wash in 0.2 x SSC at 65°C 
with a 2.1 kb Apal/Apal fragment and a 3.1 kb Eco RI/Eco 
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RI fragment derived from digestion of P UC13 including 
the mouse Egr-1 clone OC3.1. One positive clone, from 
approximately 300,000 screened, was designated mgEgr-1.1 
and also hybridized to the extreme 5 '-end 120 bp Eco RI- 
Apa I fragment from plasmid OC3.1. 

A 2.4 kb Pvu-H-PvuII fragment and a 6.6 kb 
Xbal-Xbal fragment (see Figure 5) derived from the 
mgEgr-1.1 clone were subcloned into the Smal and Xbal 
sites of P UC13 and P OC18 respectively, and the resulting 
15 plasmids (designated as p2.4 and p6.6) were used for 

restriction mapping analysis of transcription initiation 
sites and for nucleotide sequencing. Marked in Figure 
20 4, and listed in Table 1, are possible regulatory 
elements identified in the 5' flanking sequence of 
mgEgr-1.1. A putative TATA motif (AAATA) is located 26 
nucleotides upstream of the transcription start site. A 
^ 5 "CCAAT" type sequence starts at nucleotide -337. Five 
different regions, each 10 nucleotides in length, 
located at -110, -342, -358, -374, and -412, are nearly 
30 identical to the inner core of the c-fos serum response 
element (Treisman, R. , Cell, 46, 567 (1986)). Each has 
a 5-6 nucleotide AT rich stretch and is surrounded by 
35 the dinucleotide CC on the 5' side and GG on the 

other. Two potential TPA responsive elements (Lee, W., 
et al.. Cell , 49, 741 (1987) and Angel, P., et al . , 
Cell , 49, 729 (1987)) are located at nucleotides -610 
and -867. Four consensus Spl (Briggs, M.R., et al . , 
Science , 234 , 47 (1986) binding sequences are at 
position -285, -649, -700 and -719. In addition, two 
sequences have been identified that might serve as cAMP 
response elements (Montimy, M.R., ft al. , Nature, 328, 
175 (1987)) (-138 and -631). 
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TABLE 1 

Location and Identification of 
Potential Regulatory Elements 



Element 

TATA 
CCAAT 



Seguence J 

AAATA 
CCAAT 



Serum Response Element 
Consensus 

GATGTCCATATTAGGACATC 
CC TA AT GG 
G C 



TCCTTCCATATTAGGGCTTC 
GTGGCCC— AATATGGCCCTG 
C AGCGC CTTATATGG AGTGG 
ACAGACCTTATTTGGGCAGC 
AAACGCCATATAAGGAGCAG 



TPA Responsive Element 
(API binding site) 
Consensus 

C C 

TGACT A 
G A 



30 Spl binding site 



CTGACTCG 
CTGACTGG 



GGGCGG 
GGGCGG 
CCGCCC 
GGGCGG 



Location* 

-26 to -22 
-337 to -333 



-110 to -91 
-342 to -324 
-358 to -339 
-374 to -355 
-412 to -393 



-610 to -603 
-867 to -860 



-285 to -280 
-649 to -644 
-700 to -695 
-719 to -714 



35 cAMP Response Element 
Consensus 

TGACGTCA 



TCACGTCA 
TGACGGCT 



-138 to -131 
-631 to -624 



1. The underlined bases in the mouse Egr-1 gene 
sequence are those that do not match the consensus 
sequence. 

2. The location numbers refer to the nucleotides of the 
mouse Egr-1 gene as indicated in Figure 4. 
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To obtain the genomic sequence and the intron- 
exon gene structure, specific oligonucleotides (17-mers 
at positions 83, 122, 174, 200, 379, 543, 611, 659, 905, 
920, 1000, 1200, 1400, 1600, 1800, 2100, 2353, 2650, 
2825) of the OC3.1 cDNA sequence (see Figure 1A) were 
used as primers for double stranded sequencing of 
plasmids p2.4 and p6.6. Comparison of the Egr-1 
genomic sequence to the Egr-1 cDNA sequence showed the 
Egr-1 gene consists of 2 exons and a single 700 bp 
15 intron (between nucleotide position 556 and 557 as 

numbered in Figure 1A and as shown in Figure 5). Both 
the 5' and 3« splice junction sequences (not shown) are 
20 in excellent agreement with the consensus boundary 

sequences. Mount, S.M., Nucleic Acids Res., 10' 459 
(1982) . 

25 EXAMPLE 6 



Isolation and Characterization 

30 of Human EGR2 cDNA 

A human genomic placental library in the 



35 



vector EMBL3, prepared by Dr. C. Westbrook of the 
Oniversity of Chicago according to procedures described 
in Frischauff et al., -Tour. Mol. Biol., 170' 827-842 
(1983), and a human leukocyte cosmid library prepared 
according to procedures described in Proc. Mat' 1. Acad. 
Sci. (USA) , 80, 5225-5229 (1983), were probed with the 
2.1 kb Apal fragment of OC3.1 (described in Example 5) 
using 1% SDS, 1 M NaCl and 10% dextrose sulfate at 
50-55°C with a non-stringent final wash in 2 x SSC at 
50-55»C. A single positive clone (designated HG6) was 
isolated from the first library and four clones 
(designated HG17,- 18, 19 and 21, respectively) were 
isolated from the second library. A 6.6 kb Sall/EcoRI 
fragment of clone HG6 was found to hybridize wi.th a 332 
base pair Hpall/Hpall fragment of the mouse Egr-1 gene, 
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which letter fragment spans the putative zinc finger 
region. The 6.6 kb fragment , in turn, was employed to 
5 probe a cDNA library derived from human fibroblasts 
which have been stimulated for three hours with 20% 
fetal calf serum in the presence of 10 ug/ml 
cyclohexamide . About 10 , 000 clones were screened and 
the fifty positive clones obtained (designated "zap-l 
through zap-50") are being subjected to nucleotide 
sequence analysis. Preliminary sequence analysis 
15 reveals that three clones , zap-2, zap-8, and zap-32, all 
encode the same transcript, namely a protein designated 
human EGR2 , shown in Figure 3. Preliminary analysis 
20 indicates approximately 92% homology between mouse Egr-1 
and human EGR2 polypeptides in the zinc finger regions , 
but substantially less homology in the amino and carboxy 
25 terminal regions- Chromosome mapping studies, similar 
to those described in Example 2, indicate that human 
chromosome 10, at bands q21-22, constitutes a locus for 
the human EGR2 gene. 
30 The plasmid zap-32, containing the full length 

human EGR2 clone, was used as a probe in Southern blot 
analysis on DNAs from 58 unrelated Caucasians- It was 
35 found that Hind III detects a simple two-allele 

polymorphism with bands at either 8.0 kb (Al) or 5.6 kb 
and 2.4 kb (A2) . No constant bands were detected. The 
frequency of Al was 0.90 and that of A2 was 0*10. No 
polymorphisms were detected for Apa I, BamH I, Ban II, 
Bgl I, Bgl II r BstE II, Dra I, EcoR I, EcoR V, Hinc II, 
Msp I, Pst I, Pvu II, Rsa I, Sac I, and Taq I in 10 
unrelated individuals. Co-dominant segregation of the 
Hind III RFLP was observed in four large kindreds with a 
total of more than 350 individuals. 

These data will be useful in gene linkage 
studies for mapping genes for certain genetic 
disorders. For example, the gene responsible for the 
dominantly inherited syndrome, multiple endocrine 
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neoplasia, type 2A (MEN-2A) has been assigned by linkage 
to chromosome 10. Simpson, et al. , Nature , 328, 528 
(1987). Studies are currently underway to determine the 
linkage relationship between MEN- 2 A and EGR2 and are 
expected to be useful in cloning the MEN- 2 A gene as well 
as in serving as a diagnostic marker for the disease. 



EXAMPLE 7 



15 Recombinant Expressi on Of Fusion Proteins 

A 322 base Hpall/Hpall fragment (comprising 
nucleotides 1231-1553) derived from the OC3.1 cloned DMA 
20 was treated with DNA polymerase to fill in the single 
stranded ends. This fragment was inserted in plasmid 
pEX3 (obtained from K. Stanley, European Molecular 
Biology Laboratory, Postfach 10.2209, 6900 Heidelberg, 
2t> F.R.G.) digested with Smal. Stanley, K.K., et al. , EMBO 
j., 3, 1429 (1984). This insertion placed the Egr-1 
encoding DNA fragment in the same reading frame as 
plasmid DNA encoding cro-s-galactosidase, allowing for 
the expression of a fusion protein comprising the amino 
terminal residues of cro-B-galactosidase and 108 resi- 
dues of Egr-1 amino acids 325 to 432. This cro-s- 
galactosidase/Egr-1 fusion plasmid, designated pFIG, was 
used to transform E. coli NF1. 

induced (42"C) and un-induced (30'C) cultured 
cell lysates from growth of the transformed NF1 cells 
were then analyzed by SDS-PAGE. Upon Coomassie stain 
analysis, only induced cell lysates included an 
approximately 108 kd product, indicating presence of the 
projected expression product. Western blot analysis, 
using the rabbit polyclonal anti-peptide antibody VPS10 
(see Example 4) raised against H-L-R-Q-K-D-K-K-A-D-K-S- 
C, confirmed that the fusion protein product contained 
Egr sequences. 
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in a separate construction, a mouse Egr-1 
insert, from plasmid 0C3.1, was fused, in frame, to a 
5 plasmid containing sequences from bovine growth hormone 
according to the methods described in. Slamon, D.J., et 
al., Science , 233 , 347 (1986). The resultant plasmid, 
designated pV4, comprised a fusion protein containing a 

10 fusion gene coding for bovine growth hormone amino acids 
1 to 192 and Egr-1 amino acids 2 to 533. This bGH/mouse 
Egr-1 DNA fusion plasmid, designated pV4 r was expressed 

15 in E. coli and the resulting fusion protein, designated 
V4, was identified in Western blots by its reactivity 
with a bGH monoclonal antibody and its reactivity with 

20 VPS10 rabbit anti-Egr-1 peptide antiserum, prepared 
according to Example 4. 

EXAMPLE 8 

25 " 

Determination of Egr 
Levels in Human Tumor and 
30 Non-Tumor Tissue 

J "* Using the mouse Egr-1 OC68 probe, Northern 

blot analyses were conducted to determine the levels of 
transcription of Egr protein encoding DNA in tumor 

35 versus surrounding normal tissue from resected human 

tumor specimens. The tumor samples were from lung (12), 
colon (7), colon mesastasis (1), bladder (1), rectal 
(1), giant cell (1), hepatoma (1), breast (1), MFH 
(malignant fibrous histiocytoma) (1), osteosarcoma (1) 
and rhabdomyosarcoma (1). In about 50% of these cases, 
there is markedly decreased (about three to ten-fold) 
expression of the Egr mRNA in tumor versus normal 
tissue. One implication of this finding is that Egr 
proteins of the invention may function as part of a 
negative regulatory pathway. In any event, it is clear 
that DNA sequences and antibodies of the invention are 
susceptible to use in differential diagnoses between 
tumorous and non-tumor ous cell types. 
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It will be apparent from consideration of the 
foregoing illustrative examples that the present inven- 
tion constitutes a substantial advance in the art and 
the achievement of a major goal in molecular biology, 
i e. , the characterization of genes which play a regu- 
latory role in mammalian cell proliferation and differ- 
entiation. It will thus be understood that the informa- 
tion provided herein constitutes a basis for straight- 
forward development of useful methods and materials not 
15 specifically the subject of the above examples. By way 
of illustration, possession of knowledge concerning the 
base sequence of cDKA and genomic DNA sequences encoding 
distinct mouse Egr-1 and human EGR2 early growth regu- 
latory proteins comprising histidine-cysteine zinc 
finger amino acid sequences makes possible the isolation 
of other such structurally related proteins. The sub- 
stantial homology between the zinc finger regions of 
Egr-1 and EGR2 coupled with lack of homology in other 
protein regions, when considered in light of the ability 
of Egr-1 probes to localize to human chromosome 5 while 
EGR2 probes localize to human chromosome 10, essentially 
assures the straightforward isolation of a human gene 
(provisionally designated "human EGR1 " ) which encodes a 
protein more closely homologous to Egr-1 and a mouse 
gene (Egr-2) encoding a protein more closely homologous 
to EGR2 . 

While the above examples provide only limited 
illustration of in vitro and in vivo expression of DNA 
sequences of the invention, known recombinant techniques 
are readily applicable to development of a variety of 
procaryotic and eucaryotic expression systems for the 
large scale production of Egr proteins and even develop- 
ment of gene therapy regimens. 

Knowledge of the specifically illustrated 
mouse Egr-1 and human EGR2 proteins of the invention has 
been demonstrated to provide a basis for preparation of 
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highly useful antibodies , also provides a wealth of 
information concerning the nature of protein-nucleic 
5 acid interactions which, in turn, constitutes a basis 

for determination of significant early growth regulatory 
events. For example, and by analogy to steroid receptor 
1Q protein structures, analysis of the structure of regions 
X ° flanking the zinc fingers of Egr-1 and EGR2 and related 
proteins of the invention is expected to allow for 
identification of substances which may interact with the 
15 proteins to alter their DNA interactive capacities and 

thus provide the basis for inhibition or augmentation of 
their regulatory functions. Moreover, information 
20 available concerning specific events of DNA interaction 
of Egr proteins of the invention will permit, e.g., 
identification and use of potential competitive 
25 inhibitors of these proteins. 

just as Egr encoding DNA of the invention is 
conspicuously susceptible to use in differentiation of 
human tumor and non-tumor cells, antibodies prepared 
30 according to the invention are expected to be useful in 
differential screening of cells based on relative 
cellular concentrations of mRNA expression products and 
35 in the determination of specific genes susceptible to 
regulation by such products- 

Because numerous modifications and variations 
in the practice of the present invention are expected to 
occur to those skilled in the art, only such limitations 
as appear in the appended claims should be placed 
thereon • 
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WHAT IS CLAIMED IS 

1. A purified and isolated DNA sequence 
encoding a mammalian early growth regulatory protein 
which comprises one or more histidine-cysteine zinc 
finger amino acid sequences. 

2. The DNA sequence according to claim 1 
encoding human EGR2 protein. 

3. The DNA sequence according to claim 1 
encoding mouse Egr-1 protein. 

4. The DNA sequence according to claim 1 
which is a cDNA sequence. 

5. The DNA sequence according to claim 1 
which is a genomic DNA sequence. 



30 6 . The DNA sequence according to claim 1 

which is a wholly or partially synthetic DNA sequence. 



7. The DNA sequence according to claim 1 
operatively associated with an homologous or 
heterologous expression control DNA sequence. 

8. The DNA sequence according to claim 1 
selected from the group consisting of the DNA sequences 
set out in Figures 1A, 3, and 4. 

9. A procaryotic or eucaryotic host cell 
transformed or transfected with a DNA sequence 
comprising a DNA sequence according to claim 1 or 7. 

10. A viral or circular DNA plasmid vector 
comprising a DNA sequence according to claim l. 
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11. The viral or circular DNA plasmid vector 
according to claim 10 further comprising an expression 
control DNA sequence operatively associated with said 
early growth regulatory protein encoding DNA. 



12. A method for the production of an early 
growth regulatory protein comprising: 

growing , in culture, a host cell transformed 
15 or transfected with a DNA sequence according to claim 1; 
and 

isolating from said host cell or culture the 
20 polypeptide product of the expression of said DNA 
sequence. 



13. A method for the production of an early 
growth regulatory protein comprising: 

disposing a DNA sequence according to claim 1 
in a cell free transcription and translation system? and 

isolating from said system the polypeptide 
product of the expression of said DNA sequence. 



35 14. The polypeptide product of the in vitro 

or in vivo expression of part or all of a protein encoding 
region of a DNA sequence according to claim 1. 

15. The polypeptide product according to 
claim 14 which is a fusion protein comprising part or 
all of a mammalian early growth regulatory protein which 
comprises one or more histidine-cysteine zinc finger 
amino acid sequences and part or all of a heterologous 
proteins. 

16. The polypeptide product according to 
claim 15 r which comprises a fusion of cro-B- 
galactosidase and Egr-1 amino acid sequences or bovine 
growth hormone and Egr-1 amino acid sequences. 
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17. A purified and isolated mammalian early 
growth regulatory protein comprising one or more histi- 
dine-cysteine zinc finger amino acid sequences. 

18. The protein according to claim 17 and 
having the amino acid sequence set out in Figure 1A. 

19. The protein according to claim 17 and 
15 having the amino acid sequence set out in Figure 3. 

20. A synthetic peptide duplicative of a 
20 sequence of amino acids present in a mammalian early 

growth regulatory protein comprising one or more histx- 
dine-cysteine zinc finger amino acid sequences and 
sharing at least one antigenic epitope of such 
protein. 

21. The synthetic peptide of claim 20 and 
30 having an amino acid sequence partially duplicative of 

the amino acid sequence set out in Figure 1A. 

22. The synthetic peptide of claim 20 and 
having an amino acid sequence partially duplicative of 
the amino acid sequence set out in Figure 3. 

23. The synthetic peptide of claim 20 and 
further characterized as duplicative of an amino acid 
sequence not involved in DNA binding functions. 

24.. An antibody specifically immunoreactive 
with at least one epitope of a mammalian early growth 
regulatory protein comprising one or more histidine- 
cysteine zinc finger amino acid sequences. 
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25. The antibody according to claim 24 
wherein said epitope is other than an epitope within the 

5 DMA binding functional region thereof. 

26. The antibody according to claim 24 
selected from the group consisting of monoclonal and 
polyclonal antibodies. 

27. The antibody according to claim 24 

15 capable of specifically binding with a proteinaceous 

material comprising an amino acid sequence duplicating 
an antigenic epitope within the following amino acid 

20 sequence: 

H-L-R-Q-K-D-K-K-A-D-K-S-C . 



25 



30 



28 . The antibody according to claim 24 
capable of specifically binding with a proteinaceous 
material comprising an amino acid sequence duplicating 
an antigenic epitope within the following amino acid 
sequence i 

C-G-R-K-F-A-R-S-D-E-R-K-R-H-T-K-I . 



35 29. A method for quantitative detection of a 

mammalian early growth regulatory protein comprising one 
or more histidine-cysteine zinc finger amino acid 
sequences based on the immunological reaction of the 
same with an antibody according to claim 24. 

30. A method for quantitative detection 
within a sample of messenger RNA transcripts for 
mammalian early growth regulatory proteins comprising 
one or more histidine-cysteine zinc finger amino acid 
sequences comprising the step of hybridizing RNA within 
said sample with a DNA sequence partially or wholly 
duplicating a DNA sequence according to claim 1. 
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31. A method for quantitative detection 
within a sample of DNA encoding for mammalian early 
growth regulatory proteins comprising one or more 
histidine-cysteine zinc finger amino acid sequences 
comprising the step of hybridizing DNA within said 
sample with a DNA sequence partially or wholly 
duplicating a DNA sequence according to claim 1. 



32. A method for detecting a disease 
15 genetically linked to a mammalian Egr gene comprising 
the step of quantitating mammalian early growth 
regulatory DNA sequences according to claim 1. 
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